Explaining dimensionality reduction results using Shapley values
نویسندگان
چکیده
• Clusters of dimensionality reduction results are interpreted using Shapley values. Summary visualizations convey the decisions processes. values can reveal ubiquitous information medical datasets. Dimensionality (DR) techniques have been consistently supporting high-dimensional data analysis in various applications. Besides patterns uncovered by these techniques, interpretation DR based on each feature’s contribution to low-dimensional representation supports new finds through exploratory analysis. Current literature approaches designed interpret do not explain features’ contributions well since they focus only or consider relationship among features. This paper presents ClusterShapley address problems, generate explanations and algorithms a cluster-oriented explains formation clusters meaning their relationship, which is useful for domains. We propose novel visualization guide clustering validate our methodology case studies publicly available The demonstrate approach’s interpretability power insights about pathologies patients different conditions results.
منابع مشابه
Explaining three-dimensional dimensionality reduction plots
Understanding three-dimensional projections created by dimensionality reduction from high-variate datasets is very challenging. In particular, classical three-dimensional scatterplots used to display such projections do not explicitly show the relations between the projected points, the viewpoint used to visualize the projection, and the original data variables. To explore and explain such rela...
متن کاملDimensionality reduction with missing values imputation
In this study, we propose a new statical approach for high-dimensionality reduction of heterogenous data that limits the curse of dimensionality and deals with missing values. To handle these latter, we propose to use the Random Forest imputation’s method. The main purpose here is to extract useful information and so reducing the search space to facilitate the data exploration process. Several ...
متن کاملDimensionality Reduction Using a Randomized Projection Algorithm: Preliminary Results
We describe an implementation and experiments with a low-distortion randomized projection algorithm [LINI94] that can reduce the number of dimensions in the data by a considerable amount. The performance of the randomized algorithm is compared with that of a popular technique---Principal Component Analysis (PCA). The experiments show that the randomized projection algorithm consistently outperf...
متن کاملWeighted Shapley levels values
This paper presents a collection of four different classes of weighted Shapley levels values. All classes contain generalisations of the weighted Shapley values to cooperative games with a level structure. The first class is an upgrade of the weighted Shapley levels value in Gómez-Rúa and Vidal-Puga (2011), who use the size of components as weights. The following classes contain payoff vectors ...
متن کاملShapley Inconsistency Values
There are relatively few proposals for inconsistency measures for propositional belief bases. However inconsistency measures are potentially as important as information measures for artificial intelligence, and more generally for computer science. In particular, they can be useful to define various operators for belief revision, belief merging, and negotiation. The measures that have been propo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Expert Systems With Applications
سال: 2021
ISSN: ['1873-6793', '0957-4174']
DOI: https://doi.org/10.1016/j.eswa.2021.115020